Neptune: a bioinformatics tool for rapid discovery of genomic variation in bacterial populations
نویسندگان
چکیده
The ready availability of vast amounts of genomic sequence data has created the need to rethink comparative genomics algorithms using 'big data' approaches. Neptune is an efficient system for rapidly locating differentially abundant genomic content in bacterial populations using an exact k-mer matching strategy, while accommodating k-mer mismatches. Neptune's loci discovery process identifies sequences that are sufficiently common to a group of target sequences and sufficiently absent from non-targets using probabilistic models. Neptune uses parallel computing to efficiently identify and extract these loci from draft genome assemblies without requiring multiple sequence alignments or other computationally expensive comparative sequence analyses. Tests on simulated and real datasets showed that Neptune rapidly identifies regions that are both sensitive and specific. We demonstrate that this system can identify trait-specific loci from different bacterial lineages. Neptune is broadly applicable for comparative bacterial analyses, yet will particularly benefit pathogenomic applications, owing to efficient and sensitive discovery of differentially abundant genomic loci. The software is available for download at: http://github.com/phac-nml/neptune.
منابع مشابه
Neptune: A Tool for Rapid Genomic Signature Discovery
Neptune locates genomic signatures using an exact k -mer matching strategy while accommodating k -mer mismatches. The software identifies sequences that are sufficiently represented within inclusion targets and sufficiently absent from exclusion targets. The signature discovery process is accomplished using probabilistic models instead of heuristic strategies. We have evaluated Neptune on Liste...
متن کاملNeptune: A Tool for Rapid Microbial Genomic Signature Discovery
Neptune locates genomic signatures using an exact k -mer matching strategy while accommodating k -mer mismatches. The software identifies sequences that are sufficiently represented within “inclusion targets” and sufficiently absent from “exclusion targets”. The signature discovery process is accomplished using probabilistic models instead of heuristic strategies. We have evaluated Neptune on L...
متن کاملRapid Detection of Campylobacter jejuni by Polymerase Chain Reaction and Evaluation of its Sensitivity and Specificity
Introduction: Campylobacter jejuni is one of the most common causes of food poising in humans. Rapid and specific detection of these bacteria has an important role in diagnosis and treatment of infection. The aim of this study was to design a specific PCR for the detection of Campylobacter jejuni. Methods: In this experimental study, oxidoreductase gene from the Campylobacter jejuni was sele...
متن کاملRapid DNA extraction of bacterial genome of Staphylococcus aureus using laundry detergents and assessment of the efficiency of DNA in downstream process using PCR
Abstract Background and objectives: Genomic DNA extraction of bacterial cells is of processes performed normally in most biological laboratories therefore, various methods have been offered, manually and kit, which may be time consuming and costly. In this paper, genomic DNA extraction of Staphylococcus aureus was investigated using some laundry detergent brands available in Iran to achieve ...
متن کاملThe Natural Variation in Six Populations of Calendula officinalis L.: A Karyotype Study
In the current investigation, karyotype analysis and chromosome characteristics of six populations of Calendula officinalis L.(pot marigold) from Iran are studied. Results showed that all populations were diploid (2n= 2x= 32), and had symmetrical karyotypes composing mainly of metacentric and submetacentric chromosomes. The mean chromosome length ranged from 1.05 in Karaj to 1.50 μm in...
متن کامل